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(57) Abstract 

Polypeptides comprising a collectin neck re- 
gion, or variant or derivative thereof or amino acid 
sequence having the same or a similar amino acid 
pattern and/or hydrophobicity profile, are able to 
trimerise. Such polypeptides may comprise addi- 
tional amino acids which may include heterologous 
amino acids, for example forming a protein domain 
or derived from an immunoglobulin or comprising 
an amino acid which may be derivatised for attach- 
ment of a non-peptide moiety such as oligosaccha- 
ride, and may form homotrimers or heterotrimers. 
Heterotrimerisation may be promoted by gentle 
heating, e.g. to about 50 °C, then cooling to room 
temperature. One use for the polypeptides is in 
seeding collagen formation. Nucleic acid encoding 
the polypeptides and methods of their production 
are provided. 
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TRIMERISING POLYPEPTIDES , THEIR MANUFACTURE AND USE 

The present invention relates to polypeptides able 
to form multimers, particularly trimers, and manufacture 
and use of such polypeptides. 
5 The biosynthesis of collagen molecules requires the 

correct alignment of three polypeptides consisting of 
Gly-Xaa-Yaa triplets to form the triple-helix [1] . Each 
chain assumes a left-handed helical structure in the 
right-handed triple-helix, which is stabilized by 

10 inter-chain hydrogen bonds. The formation of the triple 
helix proceeds from a single nucleation point at the 
C- terminal end of the three chains and grows in a 
zipper-like fashion [2] . 

Refolding experiments on collagen type III 

15 indicated that specific inter- chain disulphide- bridges 
formed between C- terminal globular protein structures, 
can be sufficient to function as a nucleus for the 
refolding of a triple -helix in vitro , whereas reduction 
abrogates this process completely [3] . However, the 

20 molecular mechanism guiding association and registered 
alignment of collagens has remained elusive since the 
family of proteins containing collagenous sequences is 
large and sequence comparison of the different types of 
C-terminal, non-collagen-like, regions did not reveal a 

25 common motif shared by FACITs (fibril associated 

collagens with interrupted triple-helix, types IX, XII, 
XIV, and XVI) , the collagens of striated fibrils (types 
I, II, III, V, and XI) , or the collagens with Clq-like 
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C-terminal domains (types VIII and X) [4] . The frequent 
formation : of inter-chain disulphide bonds has further 
complicated the search for protein modules involved in 
the inter- chain association and subsequent nucleation of 
5 triple -helix formation. 

The family of collagenous proteins known as the 
u collectins" is composed of the serum proteins 
mannan- binding protein (MBP) , collectin-43 and bovine 
conglutinin as well as the lung surfactant proteins SP-D 
10 and SP-A [5] . Collectin polypeptide chains contain a 
short N-terminal region, a collagen-like region (of 
between 20 and 59 Gly-Xaa-Yaa triplets) linked, by a 
short stretch of 34 - 39 amino acids (which form the 
*neck' region) to a .C-terminal, C-type lectin domain (of 
15 113-118 amino acids) (Figure la) . 

The present invention has resulted from results 
showing that the "neck-region" of collectin protein is 
able to mediate inter-chain recognition, trimerizat ion 
and registered alignment of three collagenous 
20 polypeptide chains [7] . The results make available 
simple means .of trimerising polypeptides of choice . 

According to the present invention there is 
provided a polypeptide comprising a neck-region of a 
collectin, or an amino acid sequence variant thereof or 
25 a derivative thereof. Such polypeptide will form a 

trimer under appropriate conditions. The polypeptide is 
non-naturally occurring, i.e. it is one not found in 
nature . 



WO 95/31540 



PCT/GB95/01104 



- 3 - 

It may comprise one or more heterologous amino 
acids joined to the neck- region or variant or derivative 
thereof . It may retain one or more amino acids from the 
molecule from which is it derived; for example the 
- 5 polypeptide may comprise a collectin C-type lectin 
domain . 

According to one definition, the present invention 
provides a non-naturally occurring polypeptide 
consisting essentially of amino acids according to the 
10 following formula: 
X-N-Y, 

wherein N is a collectin neck-region peptide or a 
variant or derivative thereof or a sequence of amino 
acids having an amino acid pattern and/or hydrophobic ity 
15 profile which is the same as or similar to that of a 
collecting neck-region, able to form a trimer; X is 
absent or one or more amino acids and Y is absent or one 
or more amino acids. If X and Y are both absent, the 
polypeptide consists essentially of N. X and/or Y may 
20 .comprise one or more heterologous amino acids, any of 
which may be derivatisable, or "chemically modifiable", 
for attachment of a chemical moiety. 

The chemical moiety may be introduced at a specific 
chemically modifiable residue or residues. A chemically 
25 modifiable amino acid residue is an amino acid residue 
susceptible to modification with a chosen chemical 
reagent under specified conditions. The amino acid may 
be unique in the polypeptide or it may be uniquely 
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modifiable, or selectively or preferentially modifiable 
over other amino acids present. For instance, a 
cysteine residue may be introduced into the binding site 
and be available for chemical modification via its thiol 
5 group. It may also be possible to render an amino acid 
preferentially modifiable compared with other amino 
acids of the same type within the molecule by 
engineering its environment, eg by positioning it within 
the molecule adjacent another amino acid with particular 

10 properties. For instance, an amino group next .to a 

carboxlate group would be rendered more nucleophilic and 
selectively modifiable even if not unique within the 
binding site. 

Other chemically modifiable amino acids include 

15 lysine, glutamate, histidine and tyrosine. 

Covalent modification allows a wide variety of 
moieties to be incorporated, particularly reporter 
groups or cof actors for catalysis. In one embodiment of 
the present invention, one or more amino acids which are 

2 0 specifically modifiable are incorporated. This allows 
the interaction of large organic groups such as the 
fluorescent reporter group, 7-nitrobenz-2-oxa-l , 3- 
diazole (NBD) . Other large groups such as the flavin 
cof actors for catalysis, FMN and FAD may be 

2 5 incorporated . 

There is also the possibility of incorporating two 
(or more) residues for modification with the same 
reagent or two (or more) different reagents, or more 
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preferably different residues may be modified with 
different reagents to incorporate different chemical 
moieties into the binding site. This is useful eg for 
catalysis where the presence of two chemical moieties 
5 such as flavin and haem may promote catalysis of a redox 
reaction. 

There are other possible ways of modifying a 
polypeptide. There are a number of amino acid residues 
which may be specifically derivatized using molecules 

10 containing specific functional groups. For instance, 
amino groups may be modified with N-hydroxysuccinimide 
esters, carboxyl groups with carbodiimides , histidines 
and cysteines with halomethyl ketones, arginine with 
glyoxals (see e.g. A.R. Fersht , Enzyme Structure and 

15 Mechanism 2nd -edn, 1985 pp248-251, W.H. Freeman, New 
York) . 

Some reagents which may be used to modify specific 
amino-acid residues are given by T. Imoto and H. Yamada 
in "Protein Function: a Practical Approach", pp247-277, 

20 1989. To introduce specific functional groups into 

polypeptides the reactive group of these reagents may be 
combined with the functional group in a modifying 
reagent. For instance, if it is desired to modify a 
protein with the fluorophore 7-amino-4-methylcoumarin-3 - 

25 acetic acid, the N-hydroxysuccinimidyl ester of the 

molecule may be used to modify amino groups, whereas N- 
[6- ( -amino-4-methylcoumarin-3-acetamido) hexyl] -3 ' - (2' - 
pyridyldithio) propionamide may be used to modify 

BNSDOCIO: <WO 9531 540A1 _l_> 
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cysteine groups . 

Another possible methodology is to use 
transglutaminase which catalyzes an acyl -transfer 
reaction between the gamma -carboxy amide group of 
5 glutamine residues and primary amines (E. Bendixen et 
al, J. Biol. Chem. 26821962-21967, 1993; K.N. Lee et al 
Biochim. Biophys. Acta 1202 1-6 1993; T. Kanaji et al J. 
Biol. Chem. 268 11565-11572 1993). This enzyme could 
therefore introduce amino acid residues from a peptide 
10 into a glutamine residue through a peptide lysine 

epsilon amino group or into a lysine group via a peptide 
glutamine group. The enzyme could also catalyse 
derivatization of glutamine residues with a primary 
amine . 

15 A further approach is to introduce chemical 

moieties to either the N or C terminus of a polypeptide 
using reverse proteolysis or chemical conjugation or a 
combination of the two (I. Fisch et al, Bioconj . Chem. 
3, 147-153, 1992; H. F. Gaertner et al, Bioconjug. Chem. 

20 3, 262-268, 1992; H. F. Gaertner et al, J. Biol. Chem. 
269, 7224-7230, 1994; J. Bongers et al, Biochim. 
Biophys. Acta, 50, S57-162, 1991; R. Offord, Protein 
Engineering, 4, 709-710, 1991). These methods have been 
used to introduce non-encoded elements to protein and 

25 peptide molecules. 

Examples of f luorophores which may be introduced 
are fluorescein, phycoerythrin, coumarin, NBD, Texas Red 
and chelated lanthanide ions. Examples of catalytic 
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groups which may be introduced are flavin adenine 
dinucleotide (FAD) , flavin mononucleotide (FMN) , 
cytochromes and chelated metal ions such as zinc and 
copper. 

5 In one embodiment the neck-region is that of the 

collect in SP-D (or a variant or derivative thereof) . 
Other possibilities include collectin-43 and 
conglutinin. Mannan- binding protein (MBP) and SP-A may 
also be useable in the present invention, though in that 

10 case additional amino acids from the respective lectin 
domains may be required. Since, however, those 
additional amino acids may have a-helical structure, as 
in the natural molecule, the sequence of amino acids 
required may still be considered to be the "neck region" 

15 of the collectin. 

Figure 1(a) shows the neck-region of SP-D, with V/L 
repeat. In certain embodiments it is preferred to 
include in the neck- region the immediately up stream G 
residue, and amino acids in between, and/or downstream 

2 0 linker, such as the one shown. A linker may be 

important for spacing folded domains in a trimer. For 
instance, "... FP..." may provide a kink in the chain. 

If not the neck-region of SP-D , the neck-region in 
a polypeptide according to the present invention may 

25 have an amino acid pattern and/or hydrophobicity profile 
the same as or similar (e.g. substantially the same as) 
to that of the neck-region of SP-D, provided the ability 
to trimerise is retained. 
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The following shows an alignment of amino acids in 
various collectin neck regions: 

position - - abcdef gabcdef gabcdef gabcdef 
human SP-D - VASLRQQVEALQGQVQHLQAAFSQYKK 

5 bovine SP-D - VNALRQRVGILEGQLQRLQNAFSQYKK 

rat SP-D - - SAALRQQMEALNGKLQRLEAAFSRYKK 

bovine conglutinin VNALKQRVT I LDGHLRRFQNAFSQYKK 
bovine collectin 43 VDTLRQRMRNLEGEVQRLQNIVTQYRK 

The positioning of V at layer 'a' and L at layer 
10 'd' was commonly believed to result preferably in 

dimers . The presence of F and Y in the above sequences 
is unusual and may have a direct influence on the degree 
of oligomerization. G is positioned exactly in the 
middle of the two 'ad' repeats. This may also be of 
15 importance for the trimerization process. Glycine 

residues behave slightly different than other residues 
in a-helices; however, no clear rules are established so 
far: G is often found at the end of helices, 
terminating them. Since residues of the 'a' and 'd' 
2 0 layers do not exactly come to be positioned on top of 
one another Cal' on top of ' a2' , 'dl' on top of 'd2') 
in the left-handed supercoil of the a-helical bundle, 
the central positioned glycine residue might be relevant 
for a slightly altered supercoil of the helices, and 
25 thus for a different packing behaviour or the 

hydrophobic residues at the 'a' and 'd' layers. This in 
turn may be part of the reason for the exclusive 
trimerization of the given sequence. The C-terminal 
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'ad' layers of rather large, bulky hydrophobic residues 
F and Y will require a different packing behaviour than 
the standard a-helical bundle, possibly also affecting 
the overall twist and geometry of the coiled-coil. The 
5 abundant presence of Q residues might also be important, 
since Q residues at 'e' and 'g' positions can contribue 
also to the forces holding the helices together, and 
this is emphasised by the substitutions seen from human 
SP-D to, for example, bovine collect in .43, where a Q to 
10 R substition at the 'g' position is paralleled by a Q to 
E substition at the 'e' layer of directly following ' a- 
g' repeat, thus providing for a ionic interaction with 
opposite charges at the positions of the Q residues in 
human SP-D. 

15 The neck-region peptide has therefore a number of 

distinct features which may be altered perhaps to 
influence the properties of the peptide to form a 
trimeric a-helical bundle. The alignment shows 
naturally occurring neck-regions of collectins which 

2 0 form trimers and which also show a number of 

substitutions, apparently not affecting the trimerizing 
capability. 

Trimerising features of the neck-region peptide may 
be further enhanced by lengthening of the "a"- n d" 
25 repeats, particularly at the N-terminal end, for 

instance by addition of another copy of the first part 
of the neck region: 

VASLRQQVE ALQGQ - VASLRQQVEALQGQVQHLQAAFSQYKK 
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For human administration purposes, preferably the 
neck- region and/or the heterologous sequence of amino 
acids are -human in origin, or u humanized" , in order to 
reduce the likelihood of there being an immune response 
5 generated upon administration of the polypeptide to an 
individual. The term "heterologous sequence of amino 
acids" refers to a chain of amino acids which is not 
found naturally joined to the collectin neck-region at 
the position of fusion in the polypeptide of the 

10 invention. 

Amino acids joined to the neck region (etc.) may 
form a protein domain. Preferably, the sequence of 
amino acids forms a functional domain. The amino acids 
may comprise a sequence derived from an immunoglobulin, 

15 eg variable domain, or variable domain and constant 
region. 

In principle, any amino acid sequence including a 
peptide or polypeptide, .independently folding protein 
domain or protein domains may be joined to the 

2 0 neck-region peptide. This may be at the C- terminal or 
the N- terminal end of the polypeptide or at both ends, 
involving identical, similar, or different protein 
sequences. " Joining" may involve use .of recombinant DNA 
technology to generate a fusion polypeptide or the use 

2 5 of chemical synthesis of a polypeptide including the 

neck-region (or a variant thereof or derivative thereof) 
or the chemical attachment of polypeptides to the 
neck-region peptide. 
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Any three identical or different polypeptides 
containing the neck- region may form homo- trimers or 
hetero- trimers under appropriate conditions. A 
homotrimer consists of three polypeptides which are the 
5 same. A heterotrimer consists of three polypeptides, at 
least two of which are different. All three 
polypeptides may be different. One, two or all three 
polypeptides in a heterotrimer may be a polypeptide 
according to the invention, provided each polypeptide 

10 has a region able to trimerise. 

The neck-region of-helical bundle generally exists 
only as a trimeric molecule in conditions which mimic or 
approximate physiological conditions.- The stability of 
the trimer may be enhanced by increasing the ionic 

15 strength of the solution. The trimeric association may 
be reversibly broken up by denaturation (e.g. heat 
denaturation) and the molecules can re-associate into 
trimers as soon as conditions return to physiological 
conditions (cooling). The neck-region peptide's ability 

2 0 to trimerize is independent of adjacent protein 
sequences. Using different polypeptides, each 
containing the neck-region, the reversible denaturation- 
reassociation process can be used to form heterotrimeric 
molecules. Preferably, the conditions for denaturation 

25 are chosen to prevent loss of any property of the 

adjacent heterologous protein domain. One method of 
producing heterotrimers is specified in Example 3 and 
involves heating (in this case to about 50 °C) and 
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cooling . 

The heterotrimerization using about 50 *C is only an 
example at a certain ionic strength; other means of 
heterotrimerizations may be preferable. The methods 
5 employed may vary depending on the application. The 
different neck-region peptide constructs can be 
chemically synthesized, modified, or generated in an 
expression system. A chemical attachment site can be as 
specific as a fusion-protein, since a. reactive group can 
10 be placed at a specific position at the N-terminal or C- 
terminal (or less likely, but possible, central) part of 
the neck-region peptide. The molecules attached at 
these sites can be peptides or organic compounds. 
Although the present invention is generally 
15 applicable, not all protein domains may be used with the 
. neck region equally well. Very large domains may 
require specially adapted linker sequences and, most 
importantly, domains which show dimerizing or 
oligomerizing properties can form large aggregates which 
20 could be entirely insoluble or otherwise unsuitable for 
the use they were intended for. Also, the neck- region 
containing peptides should preferable be purified 
without the 'use of organic solvents such as acetonitrile 
used in reversed-phase chromatography. If used, for 
2 5 example after chemical synthesis, these compounds should 
be thoroughly removed since they can interfere with the 
trimerization. In a similar way, the presence during 
(hetero-) trimerization of sodium dodecylsulphate or 
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similar, strong, ionic detergents should be avoided. 
(However, as these compounds disrupt the hydrophobic 
forces that hold the helices of the neck-region together 
they can, in a controlled way, be also useful reagents 
5 in the heterotrimerization at low temperatures.) 

An amino acid sequence variant may comprise one or 
more changes, e.g. by way of addition, substitution, 
insertion or deletion of one or more amino acids, 
compared with wild type. Any such change should not 

10 abolish the ability of the polypeptide to form a trimer, 
though it may increase or decrease this ability 
depending on the nature of the change. A derivative has 
some modification compared to the naturally-occurring 
neck-region, which may be chemical. This may include 

15 the chemical or enzymatical attachment of carbohydrate 
structures, nucleic acids, or other chemical compounds, 
especially those used as antigen or those used in other 
chemical or biological interactions, such as 
ligand- receptor interactions . 

20 Changes may be made to the amino acid sequence, 

compared with wild-type, by providing and manipulating 
suitable encoding nucleic acid used for the production 
of the polypeptide in an expression system. The present 
invention further provides nucleic acid comprising a 

2 5 sequence of nucleotides encoding a polypeptide able to 
form a trimer and comprising a neck-region of a 
collect in, an amino acid sequence variant thereof or 
derivative thereof, or a sequence of amino acids having 
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an amino acid pattern and/or hydrophobicity profile the 
same as or similar to the neck region of collectin SP-D, 
fused to a heterologous sequence of amino acids, as 
disclosed herein. 
5 The nucleic acid may comprise an appropriate 

regulatory sequence operably linked to the encoding 
sequence for expression of the polypeptide. Expression 
from the encoding sequence may be said to be under the 
control of the regulatory sequence. 

10 Also provided by the present invention are a vector 

comprising nucleic acid as set out above , particularly 
any expression vector from which the encoded polypeptide 
can be expressed under appropriate conditions , and a 
host cell containing any such vector or nucleic acid. 

15 A convenient way of producing a polypeptide 

according to the present invention is to express nucleic 
acid encoding it. Accordingly, the present invention 
also encompasses a method of making a polypeptide 
according to the present invention, the method 

2 0 comprising expression from nucleic acid encoding the 
polypeptide, either in vitro or in vivo. The nucleic 
acid may be part of an expression vector. Expression 
may conveniently be achieved by growing a host cell, 
containing appropriate nucleic acid, under conditions 

25 which cause or allow expression of the polypeptide. 

Systems for cloning and expression of a polypeptide 
in a variety of different host cells are well known. 
Suitable host cells include bacteria, mammalian cells, 
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yeast and baculovirus systems. Mammalian cell lines 
available in the art for expression of a heterologous 
polypeptide include Chinese hamster ovary cells, HeLa 
cells, baby hamster kidney cells and many others. A 
5 common, preferred bacterial host is E. coli. 

Suitable vectors can be chosen or constructed, 
containing appropriate regulatory sequences, including . 
promoter sequences, terminator fragments, 
polyadenylation sequences, enhancer sequences, marker 
10 genes and other sequences as appropriate. Vectors may 
be plasmids, viral e.g. 'phage, or phagemid, as 
appropriate. For further details see, for example, 
Molecular Cloning: a Laboratory Manual: 2nd edition, 
Sambrook et al . , 1989, Cold Spring Harbor. Laboratory 
15 Press. Many known techniques and protocols for 

manipulation of nucleic acid, for example in preparation 
of nucleic acid constructs, mutagenesis, sequencing, 
introduction of DNA into cells and gene expression, and 
analysis of proteins, are described in detail in Short 
20 Protocols in Molecular Biology, Second Edition, Ausubel 
et al . eds., John Wiley & Sons, 1992. The disclosures 
of Sambrook et al . and Ausubel et al . are incorporated 
herein by reference. 

Thus, a further aspect of the present invention 
25 provides a host cell containing nucleic acid as 

disclosed herein. A still further aspect provides a 
method comprising introducing such nucleic acid into a 
host cell. The introduction may employ any available 



WO 95/31540 



PCT/GB95/01104 



- 16 - 

technique. For eukaryotic cells, suitable techniques 
may include calcium phosphate transf ection, DEAE- 
Dextran, electroporation, liposome-mediated transf ection 
and transduction using retrovirus or other virus, e.g. 
5 vaccinia or, for insect cells, baculovirus . For 

bacterial cells, suitable techniques may include calcium 
chloride transformation, electroporation and 
transf ection using bacteriophage. 

The introduction may be followed by causing or 
10 allowing expression from the nucleic acid, e.g. by 

culturing host cells under conditions for expression of 
the gene . 

In one embodiment, the nucleic acid of the 
invention is integrated into the genome (e.g. 
15 chromosome) of a host cell. Integration may be promoted 
by inclusion of sequences which promote recombination 
with the genome, in accordance with standard techniques. 

Following expression, polypeptides may be caused or 
allowed to trimerise. This may be prior to or following 
20 isolation. 

The tightly associated trimer of a-helices found at 
the neck-region of SP-D is, to our knowledge, the first 
example of a self -assembling structural motif, 
C- terminal to a collagen triple-helical structure, which 
25 does not involve the formation of disulphide bridges. 
Also, our findings demonstrate (see example 1) that, 
although collagenous sequences of repeating Gly-Xaa-Yaa 
triplets require additional protein sequences for 
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inter-chain recognition, and association at their 
C- terminal ends, to initiate folding to an intact 
triple-helix, this association itself does not have to 
be in a staggered fashion in order to align the three 
5 chains in the correct register to form the staggered 
collagen helix. It seems, too, that the present 
invention may have advantages over prior art 
multimerising peptides which form dimers and tetramers 
in addition to trimers. 

10 The small size of this self -assembling domain as 

well as the lack of the requirement for disulphide 
bridges will allow particularly for the use of a 
u neck-region" peptide in the association and registered 
alignment of any collagenous polypeptide sequence, 

15 composed of Gly-Xaa-Yaa triplets,, irrespective of the 
origin of that collagenous sequence. It is feasible to 
use a neck- region peptide at the C- terminus of any 
collagenous polypeptide sequence to initiate the 
formation of collagen triple-helical conformation of the 

2 0 Gly-Xaa-Yaa triplets [3] . . However, the stability of 

collagenous structure also depends on the number of the 
triplets and the nature of the peptide structure, at the 
N-terminal end of the triple-helical region. Also, the 
hydroxylation of proline residues in Yaa. position has 

2 5 been shown to greatly enhance stability of the triple - 
helix [24] . Use of the neck-region peptide to initiate 
triple -helix formation enables the relative importance 
of these factors influencing the stability of 
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collagenous structure to be analysed. 

The present invention thus also provides the use of 
a polypeptide comprising or consisting essentially of 
the neck-region of a collectin, or an amino acid 
5 sequence variant thereof or derivative thereof, in a 
method of "seeding" of a collagenous triple-helix. 
Preferably, the polypeptide consists essentially of a 
series of collagenous triplets (Gly-Xaa-Yaa) fused at 
the C-terminus (possibly via a linker) to the "neck- 
10 region" of a collectin or amino acid sequence variant 
thereof or derivative thereof, or a sequence of amino 
acids having an amino acid pattern and/or hydrophobicity 
profile the same as or similar to the neck-region of 
collectin SP-D, with no amino acids C-terminal to the 
15 neck-region or heterologous amino acids C-terminal to 
the neck region. The "neck-region" (i.e. "first 
sequence of amino acids" as disclosed herein) may be at 
the C-terminus of the polypeptide, or there may be 
additional C-terminal amino acids, the proviso being 
20 that the polypeptide as a whole is non-naturally 
occurring . 

A method of seeding a collagenous triple-helix 
involves causing or allowing trimerization of such a 
polypeptide. It may involve first the production of the 
25 polypeptide by expression from encoding nucleic acid 
therefor. The present invention provides such nucleic 
acid, a vector comprising such nucleic acid, including 
an expression vector from which the polypeptide may be 
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expressed, and a host cell transfected with such a 
vector or nucleic acid. The production of the 
polypeptide may involve growing a host cell containing 
nucleic acid encoding the polypeptide under conditions 
5 in which the polypeptide is expressed. Systems for 
cloning and expression etc. are discussed supra. 

Trimerization may be followed by* isolation of 
trimers, e.g. for subsequent use and/or manipulation. 

As demonstrated experimentally herein neck- region 

10 peptide may be used to generate polypeptides with one or 
more amino acids carrying distinct properties, at either 
end of the neck-region, and, using hetero-trimerization, 
these properties can be combined with those carried by 
other domains in separately generated polypeptides 

15 containing the neck region. 

For example, a single -chain antibody may be 
trimerized by generating it as a fusion polypeptide with 
the neck-region in an expression system. Fusion of 
antibodies or fragments thereof, including scFv, may be 

20 directed against cell-surface molecules, such as CD8 , 
CD4 or TCR6, for instance. Trimeric molecules should 
have a higher avidity to their respective ligands than 
the monomeric forms without the neck- region. Using a 
mild heterotrimerization technique of heating (e.g. to 

25 about 50 °C) and cooling (e.g. ambient temperature) 

individual trimers may be dissociated and re-associated 
to yield heterotrimeric complexes. These complexes may 
carry a weaker affinity for each individual ligand, but 
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a strong avidity for an entity displaying two or more of 
the respective ligands. For example, trimerised anti- 
CD4 , anti-CD8 and anti y6 TCR scFv molecules would have 
strong affinity for an entity which is CD4, CD 8 and y5 
5 TCR positive. Thus, heterotrimers may be created with 
molecules with any combination of first, second and 
third binding specificities. 

Where specific recognition only involves one end, 
e.g. the C-terminal end, of the neck-region in the 
10 polypeptides, the other, e.g. N- terminal end, of the 
neck-region may be used for additional functions, e.g. 
in drug targeting or diagnostic detection. 

Further applications of homo- or 
15 heterotrimerizations may include use of any of the 
following: 

(i) peptide -ligands for receptors, especially 
low- affinity binding (e.g. neuropeptides, interleukins) . 

(ii) antigens . 

20 (iii) chemical compounds that are reactive upon ■ 

activation e.g. photo-activatable chemical crosslinkers 
that react with any molecule such as a protein, either 
specifically or generally, when close by. The neck- 
region peptide may be part of a trimer carrying a ligand 

25 with specificity for an unknown receptor; after specific 
binding to the receptor the crosslinker may be UV- 
activated and only molecules close to the ligand- 
receptor complex could be crosslinked and thus 
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identified, 

(iv) organic compounds e.g. caffeine, morphine 
(e.g. for research, diagnostic, or therapeutic use). 

(v) low affinity binding domains especially for 
5 the screening of potential inhibitors in pharmaceutical 

research. 

(vi) indicator molecules for pH, CaCl 2 
concentration or others relevant to diagnostics and 
research. 

10 (vii) carbohydrate binding domains. 

(viii) carbohydrates e.g. for binding and/or 
research on lectins. 

(ix) lipid-containing structures (these may be at 
the N- terminal end for incorporation into liposomes, 

15 e.g. containing an active molecule, and the trimerising 
polypeptides may have a specificity directing domain at 
'an end, e.g. the C-terminal end, of the neck-region. 

(x) DNA or RNA or derivatives (this may be 
useful when more than one protein shall .be directed to 

20 act at a specific site in e.g. a chromosome, or when 
simple chemical attachment of DNA to that effector 
enzyme affects its function. If the DNA -DNA interaction 
(DNA at one end of the neck- region and DNA in the 
chromosome) has a higher dissociation temperature than 

25 the neck-region peptide (which is very likely) then 
different functional polypeptides may be added 
subsequent to the initial DNA recognition. This may be 
used in a similar fashion as in-situ hybridizations, 
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where a fluorescent tag is added to the oligonucleotide, 
allowing the position of a gene on a chromosome to be 
visualized in the microscope. The neck-region DNA probe 
may therefore be hybridized to, e.g., the human 
5 chromosome at the position of a given gene at usually 
about 65-75 "C. The solution may then be cooled to about 
50 *C and unhybridized probes may be washed away. Then, 
still at about 50 °C, another neck-region polypeptide may 
be added, containing, for example, a DNA- cleaving 

10 polypeptide that would cleave anywhere if brought to 
suitable conditions e.g. addition of ATP and S- 
adenosylmethionine . The soluble, added, neck-region 
enzyme fusion protein and the 'immobilized' DNA-neck- 
region molecules may then allowed. to heterotrimerize by 

15 cooling the solution, and. after sufficient washing, the 
co- factors may be added. Now the enzyme would be active 
but would only cleave at the site of the hybridization. 
This is extremely useful, especially if the enzyme in 
question cannot stand the temperatures required to 

2 0 perform the DNA -DNA hybridization, but retains its 

activity when mildly heterotrimerized. Alternatively, 
the second neck-region fusion protein may (also) contain 
a purification- tag for the isolation of DNA that 
contains a specific DNA sequence. The system of 

2 5 specific DNA recognition with the delayed delivery to 
that recognized site of a functional protein domain may 
be used in other circumstances, such as in-situ 
hybridizations, genomic library construction (Human 



WO 95/31540 



PCT/GB95/01104 



- 23 - 

Genome Project) , in-vitro assays, or non- radioactive 
diagnostics . 

(xi) the neck-region may be attached to a solid 
matrix (this may be useful e.g. as a research tool to 
5 reversibly immobilize recombinant proteins which contain 
the neck-region) . Resin with the immobilized 
(preferably via N-terminus) neck-region may be mixed at 
about 25 °C at sub-physiological ionic strength with a 
recombinant neck-region fusion polypeptide e.g. 
10 containing a single-chain antibody, and 

heterotrimerized. Two single-chain antibody molecules 
may be bound per neck- region molecule on the resin, and 
oriented towards the solvent . The resin may then be 
used like a normal affinity matrix, but may be used 
15 again for a different molecule by releasing the single- 
chain antibody neck-polypeptides , for instance at about 
50 °C and recharging with a new neck-region peptide- 
containing molecule.) 

(xii) enzymes (especially enzymes of the same 
20 reaction pathway that are subsequently involved, such 
that the product of one reaction is the substrate for 
the next enzyme) . The close location of the enzymes may 
bring advantages due to short diffusion way and 
therefore the reduced likelihood of side-reactions. 
25 Also, the immobilization of the enzymes via the neck- 
region of one of the three polypeptide chains may bring 
advantages, such as the easy removal of the enzymes or 
the reaction on a column. This advantage may also be 
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gained by a heterotrimeric or homotritneric enzyme 
complex that may be removed from the rest of the 
solution by mixing it with an excess of neck-region- 
resin at about 50 *C, cooling, and removal of the resin. 
5 Applications may include the enzymes used in molecular 
biology, because the substrates and products of their 
actions are mainly DNA molecules which are very (thermo) 
stable . 

(xiii) cysteine residues may be added to either end 

10 or both ends of the sequence in order to generate a 

covalently linked trimer. The exact sequence containing 
the cysteines may be derived from the FACIT collagens, 
some of which are linked into trimers via disulphides 
immediately following the collagenous structure, thus 

15 allowing for a transfer of one of those sequences to the 
N- terminus of the neck-region. This may be of use to 
further increase the stability of the peptide timer 
without affecting the overall shape. 

The present invention will now be illustrated 

20 further by way of example and with reference to the 
following figures. 

Figure 1 (a) schematic drawing of the location of 
the neck-region peptide of human SP-D. Human SP-D 
consists of 12 identical polypeptide chains (each of 3 56 

25 amino acids) which assemble into 4 rod-like structures, 
each composed of three chains, which form triple -helical 
collagenous structures over residues 26 - 202. The 
C-terminal ends of the molecule contain C-type lectin 
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domains linked to the collagenous domains via the 
neck-region, whereas the N-terminus is involved in 
oligomerization of the trimers into a tetramer. The 
position of the neck-region within the human SP-D 
5 protein and the sequence of the neck-region peptide as 
expressed in Example 1. The /a' and 'd'. positions of the 
a-helical coiled-coil are indicated, (b) a-helical wheel 
drawing of the neck-region peptide. The drawing is down 
the helical axis beginning at the N-terminal valine 

10 residue at the 'a' position. 

Figure, 2 shows circular dichroism spectroscopy 
analysis of the thermal stability of the secondary 
structure of the collagenase-digested neck-region 
peptide. Quarz cuvettes containing the peptide solution 

15 (adjusted to give an OD reading at 210 nm of 1.0) were 
allowed to equilibrate for 15 min at each temperature 
selected. Circular dichroism spectroscopy profile of the 
collagenase-digested (dashed line) and the intact neck- 
region peptide, collected at 20°C, with both peptides in 

20 30 mM phosphate buffer pH 7.4, adjusted to give an UV 
absorption of 1 . 0 at 210 nm. The blank baseline was 
subtracted for both peptides and the resulting curves 
were overlaid. The spectra are almost identical and 
show a negative value of approximately -3 0 mdeg at the 

25 wavelength 207 nm, and of approximately -20 mdeg at the 
wavelength 224 nm, which is in good agreement with 
spectra expected from a-helical structures. 2(b); The 
curve shows a thermal transition at approximately 55 °C. 
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However, this transition occurs over a wide range of 
temperatures and more experimental data will be required 
to establish a more precise melting temperature for the 
neck-region peptide. 
5 Figure 3(a) shows the crosslinking agent, 

bis- (sulphosuccinimidyl) -suberate, used to react with 
amino-groups present within the neck- region peptide at 7 
residues (Lysines) , to form amide bonds, thus covalently 
linking polypeptide chains, spaced by 6 CH2 groups. 
10 Figure 3 (b) shows a schematic representation of 

trimerised neck region peptides (tubes) with seven 
collagenous amino acids (zig-zags) . 

Figure 4 shows SDS-PAGE analysis (Coomassie Blue 
R-250 stain of a 15 % (w/v) acrylamide 
15 tris-tricine-glycerol gel) of the purified 

collagenase-digested (lanes 5-8) and the intact 
neck-region peptides (lanes 1-4) , reacted with 
increasing amounts of Bis- (sulphosuccinimidyl) -suberate . 
0 mM (lanes 1 and 5) , 3 mM (lanes 2 and 6) , 5 mM (lanes 
20 3 and 7) , and 10 mM (lanes 4 and 8) of crosslinker were 
incubated for 20 min at 37°C with a constant (10 fig) 
amount of peptide in 4 0 /xl PBS before the samples were 
boiled in Tris -containing SDS-PAGE loading buffer, 
stopping the reaction. Both peptides* can be crosslinked 
25 into their respective trimeric complexes without the 
appearance of higher-order aggregates. The trimer of 
the collagenase-digested peptide runs at approximately 
16 kDa, whereas the neck -region peptide trimer shows a 
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molecular weight of approximately 22 kDa. 

Figure 5 shows a schematic representation of a 
crosslinking experiment, analysed by SDS-PAGE (see 
Figure 6) . The drawing in lane 1 show the un-crosslinked 
5 single polypeptide chains of both, the 

collagenase-digested and the intact neck-region peptide. 
Lane 2-5 represent crosslinking reactions, with lane 2 
showing the neck-region peptide analysed after a partial 
crosslinking reaction, showing monomeric, dimeric, and 

10 trimeric molecular weights. Lane 3 represents the same 
analysis for the collagenase-digested peptide, and lane 
4 indicated the expected result of a crosslinking 
reaction of both peptides when the solution is not 
heated and cooled before the crosslinking step. If the 

15 heating and cooling is carried out before the 

crosslinking reaction, heterotrimeric complexes should 
be detectable, showing intermediate molecular weights, 
as indicated in lane 5 . 

Figure 6 shows SDS-PAGE analysis (Coomassie Blue 

20 R-250 stain of a 15 % (w/v) acrylamide 

tris-tricine-glycerol gel) of the purified 
collagenase-digested and the intact neck-region 
peptides, involving crosslinked of the individual 
peptides following heating and cooling steps in PBS, and 

25 mixing of the two peptide species in different ratios 
(before heating and cooling) followed by crosslinking. 
Lane 1 shows both, the collagenase-digested and the 
intact neck-region peptide, 1 and 5 Mg respectively, 
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without crosslinker added. The neck-region peptide 
reacted with 5 mM Bis- (sulphosuccinimidyl) -suberate is 
shown in lanes 2 and 3, with the peptide solution 
incubated at 99°C for 20 min followed by chilling on ice 
5 before addition of crosslinker shown in lane 3 . The 

same reactions with the collagenase -digested peptide are 
shown in lanes 4 and 5, respectively. Peptides in lanes 
3 and 5 were subjected to heating and cooling. In lanes 
6-9 different amounts of the two peptides were mixed 

10 prior to heating and cooling, followed by crosslinking 
with 5mM crosslinker. The ratios (neck-region 
peptide : digested peptide) are 1:1 in lane 6, 4:1 in lane 
7 , 1:4 in lane 8 , and 2:1 in lane 9 . The banding piattem* 
indicates that firstly the heating and cooling did not 

15 alter the detection by crosslinking of a trimeric 
complex, secondly, as mixed complexes can be 
crosslinked, that the polypeptides of different 
complexes are dissociated at high -temperatures and that 
they re-anneal in mixed complexes, and thirdly, that 

20 these hetero- trimerizing reactions can be driven in a 
concentration-dependent manner. . 

Figure 7: The pGEX-2T-N- term-neck-region plasmid 
allows for the induction (with IPTG) of a 
glutathione- S-transf erase -N- terminus -neck -region fusion 

2 5 protein which can be cleaved with thrombin to yield the 
peptide sequence shown. (Lower case indicates pGEX 
polylinker sequence and the N- terminus sequence is 
underlined. ) The DNA construct was obtained by Smal and 
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Nrul digestion of the pGEX-2T-N- term-coll -neck-region 
plasmid and subsequent religation of the compatible 
sites . 

Figure 8 shows SDS-PAGE analysis (Coomassie Blue 
5 R-250 stain of a 15 % (w/v) acrylamide gel) of the 

purified N- term- neck -region fusion peptide, reacted with 
increasing amounts of crosslinker, under reducing (lanes 
4,5,6) and non- reducing (lanes 1,2,3) conditions. 
Peptides (50 fil) were incubated with 0 mM (lanes 3 and 

10 6) , 2 mM (lanes 2 and 5) , and 5 mM (lanes 1 and 4) 
Bis- (sulphosuccinimidyl) -suberate in PBS for 2 0 min 
before 10 pi 1 M Tris.Cl pH 8 . 0 was added to quench the 
reaction. When the reaction went almost to completion a 
protein species of approximately 2 9 kDa can be detected, 

15 whereas bands corresponding to dimeric (approximately 19 
kDa) and single chain N- term-neck-region peptide (9 kDa) 
were detected in incompletely or non-crosslinked 
reactions. Under non-reducing conditions approximately 
half of the peptide runs as a dimer, and increased 

2 0 amounts of dimers and trimers are seen compared to the 
reduced samples, in the crosslinking reactions. 

Figure 9 illustrates transfer of the 
neck-region- lectin and -lectin coding DNA fragments from 
the fusion protein generating pGEX-2T system (a) to the 

25 pET3a vector. The DNA fragments were excised using BamHl 
and EcoRl and subcloned into the pBluescript plasmid, 
which was linearized by using the same enzymes. After 
repurif ication of the resulting plasmids from 
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transfected NM554 cells the DNA fragments were excised 
using Xbal and EcoRV, thus generating compatible ends to 
the sites present in the pET3a plasmid cut with Nhel and 
EcoRV. The Nhel site is positioned at the start codon of 
5 the pET3a open reading frame. Therefore the proteins 
induced will not carry a fusion partner at the 
N- terminal end of their sequence, but only 7 amino acid 
derived from the pBluescript polylinker sequence: 
MARTSGS . 

10 Figure 10 shows SDS-PAGE analysis (Coomassie Blue 

R-25 0 stain of a 12.5 % (w/v) acrylamide gel) of the 
C-type lectin domain of human SP-D and the neck-region 
-lectin domain as purified from lysates of bacterial 
cultures induced to' express the recombinant proteins • 

15 using the pET-3a vector. Lanes 1 to 3 correspond to the 
lectin domain, and lanes 4-6 show the neck-region-lectin 
domain protein. The proteins were crosslinked using 
increasing final concentrations of 

Bis- (sulphosuccinimidyl) -suberate . Reactions of lanes 1 
2 0 and 4 contain 0 mM, lanes 2 and 5 correspond to 5 mM, 
and lanes 3 and 6 show the result of reactions with 10 
mM crosslinker. Although the neck-region-containing 
protein can be shown to consist of trimers that can be 
crosslinked, no such crosslinkable oligomers exist for 
25 the lectin domain expressed on its own. Thus, the 
neck-region can mediate trimerization whereas no 
self -associating properties could be detected for the 
lectin domain of human SP-D alone. 
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Figure 11: illustrates (a) size-exclusion 
chromatography of the neck-region peptide / neck- lectin 
protein before (A) and after heterotrimerization (B) 
using a Superose 12 column. The column was equilibrated 
5 and run in PBS at 0.3 ml/min and the UV detector was set 
at 280 nm (sensitivity of 0.02), Samples were injected 
using 100 ml loops. The neck-lectin trimeric complex 
elutes first, with the neck-region peptide trimer 
following. After heterotrimerization a shift in the 
10 elution profile of the first peak can be detected 

corresponding to trimeric protein complexes consisting 
of 2 neck-region peptides and 1 neck-lectin protein, 
(b) shows a schematic representation of the 
heterotrimerization process. The different polypeptides 
15 containing the neck- region are forming heterotrimers 
after a re-annealing step and this is a . 
concentration-driven process with the relative amounts 
of the two homotrimeric molecules used at the beginning 
determining the ratios of the products of the 
2 0 heterotrimerization reaction. 

Figure 12: The pGEX-2T-N-term-coll-neck-region 
plasmid (B) allows for the induction (with IPTG) of a 
glutathione- S-transf erase -N- terminus -coll -neck- region 
fusion protein which can be cleaved with thrombin to 
25 yield the 25 kDa SP-D polypeptide. The DNA construct 
was obtained by PCR using a 5' oligonuclotide 
introducing a BamHl site at the start of the N-terminal 
sequence and a 3' oligonucleotide introducing an EcoRl 
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site after the stop codon of the SP-D cDNA as primers, 
thus amplifying a DNA fragment coding for the entire 
protein sequence of the native SP-D protein (A) . 
Subsequently, the PCR product was cloned into 
5 pBluescript using the engineered enzyme sites and the 
resulting plasmid was digested with the restriction 
enzymes BamHl and Mscl and the DNA fragment coding for 
the N-tem-coll-neck-region was cloned into the pGEX-2T 
plasmid, linearized with BamHl and Smal. 

10 Figure 13 shows a schematic representation of the 

proposed folding process involved in the formation of 
the triple-helical collagen-like region of human SP-D. 
The neck- region associates as a parallel homotrimeric 
coiled-coil and provides thus a nucleation point for the 

15 formation of the triple-helical structure in the 

adjacent collagenous region. The collagen triple-helix 
forms in a zipper-like ' fashion from this nucleation 
point towards the N- terminal end of the polypeptide 
chain. There are numerous potential cleavage sites for 

2 0 Thrombin present within the collagenous sequence, 

however, the formation of a triple-helix would render 
these sites resistant to proteolytic digestion because 
the only known proteases to be able to digest 
collagenous structures are collagenases . 

25 Figure 14 shows (a) SDS-PAGE analysis (Coomassie 

Blue R-250 stain) of the 

Glutathione-S- trans f erase -N- term- collagen-neck- region 
fusion protein before and after thrombin cleavage (10 % 
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(w/v) acrylamide) . 50 fil solution of the eluted and 
glycine -treated fusion protein were loaded before (lanes 
1,2) and after thrombin digestion (Lanes 3-6). The 
close-up of lanes 5&6 (b) shows the less-intensely 
5 stained 25' kDa recombinant peptide running underneath 
the 25 kDa glutathione-S- transferase band. 

Figure 15: The pGEX-2T-neck-region-lectin and 
pGEX-2T-lectin plasmids were generated using either 
Smal and EcoRl (neck+lectin) or Mscl and EcoRl (lectin) 
10 to clone the SP-D cDNA fragments into the pGEX-2T 
plasmid, linearized with Smal and EcoRl. The fusion 
proteins induced are 43 kDa (neck+lectin) and 3 7 kDa 
(lectin) in expected molecular weight. 

Figure 16 shows SDS-PAGE analysis (10 % (w/v) 
15 acrylamide gel) of bacterial expression of the 

glutathione-S-transf erase fusion proteins containing the 
neck-region and the lectin region (A) and the lectin 
domain of human SP-D alone (B) . Non- induced (A lanes 2 
and 3, and B lanes 1 and 2) and induced bacteria (A lane 
2 0 1 and B lane 3) were boiled and loaded onto the gels, 
run under reducing conditions. Protein bound to 
maltose-TSK was eluted using EDTA, and peak samples (50 
/xl) loaded onto lanes 4 and 5, no protein eluted for the 
glutathione-S-transf erase-Lec fusion protein. The 
25 fusion protein containing both the neck-region as well 
as the lectin domain appears to be susceptible to 
proteolytic degradation within the cells, for a 
prominent protein band in lane A 1 is not only present 
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at the expected size of approximately 42 kDa, but also 
at approximately 30 kDa. However, this protein did not 
bind to the maltose-TSK column. The 

glutathione-S-transf erase-Lec fusion protein showed a 
5 molecular weight of approximately 37 kDa. 

Figure 17 shows a DNA construct encoding the 
immunoglobulin domains of the variable regions of the 
monoclonal rat anti-CD4 IgG antibody heavy and light 
chains (scabOX3 5) , fused together by a flexible linker 

10 sequence (a) . The DNA fragment was inserted into the 
Smal site of the pGEX-2T- neck -region peptide, resulting 
in an open reading frame for a fusion protein with the 
glutathione-S-transf erase and the neck-region. The DNA 
was also inserted into the pGEX-2T plasmid (b) , 

15 linearized with Smal, thus producing a fusion protein of 
scabOX35 with glutathione-S-transf erase alone. 

Figure 18 shows SDS-PAGE analysis (Coomassie Blue 
R-250 stain) of bacterial lysates of cultures induced to 
express glutathione-S-transf erase fusion proteins- with 

20 the OX3 5 -single -chain antibody (lane 1) the neck-region 
of human SP-D (lane 2) , and the 

neck-region-OX35-single-chain antibody (lane 3) , 
compared to non- induced bacteria (lane 4) , under 
reducing conditions (12.5 % (w/v) acrylamide) . 
25 All documents mentioned in the text are 

incorporated herein by reference. 

EXAMPLE 1 
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Trimerisation of a collectin neck region 

Figure 1(a) shows the structure of human SP-D. A 
168 bp DNA fragment, which encoded 7 Gly-Xaa-Yaa 
triplets and the 35 non-collagen like residues of the 
5 neck-region leading up to the C-type lectin domain. It 
was cloned into the pGEX-2T bacterial expression vector 
[9] and the correct orientation of the insert was 
checked by restriction digestion. High levels of 
expression of the glutathione -S-transf erase/neck-region 

10 peptide fusion-protein were obtained after 6 hours of 
.induction with IPTG. Thrombin digestion of the affinity 
purified fusion-protein resulted in two polypeptides, 
the glutathion-S-transf erase and the neck-region 
peptide, carrying an additional Gly-Ser-Pro triplet at 

15 the N- terminus and the residues Gly-Ile-Pro-His-Arg-Asp 
at the C-terminal end, representing the polylinker 
present in pGex-2T. 14 mg of the recombinant peptide 
were purified per litre culture in the three-step 
purification procedure. The peptide elutes in a single 

20 peak from the HighLoad-S column, at 800 mM NaCl . The 
purity of the peptide was confirmed by SDS-PAGE 
analysis, N- terminal sequencing of residues 1-46, and 
laser desorption mass spectroscopy (data not shown) . 
To determine the secondary structure of the 

25 peptide, far-ultraviolet CD measurements were carried 
out on the collagenase- treated neck-peptide (Figure 
2 (a) ) . The spectra show a strong positive value at 193 
nm with two negative values at 208 and 223 nm, 
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consistent with the expected profile of a-helical 
structure [10] . The structure disappears reversibly 
with increasing temperature and a thermal unfolding 
transition at 55 °C was observed. 
5 As the location of the neck-region within the SP-D 

protein suggests a parallel orientation of the oe-helices 
and the amino acid sequence of the peptide contains 
hydrophobic residues, in a repeating heptad pattern, the 
three of-helices could associate in a coiled-coil with 

10 the hydrophobic residues forming the interface between 
the helices (Figure Kb)) [11]. 

Using size exclusion chromatography, under 
non-dissociating conditions, the 65- residue-long 
peptide run as a single peak having an apparent 

15 molecular weight of 21-24 kDa. SDS-PAGE analysis showed 
single chain size of 6 kDa, however upon reaction with a 
cross-linking reagent, a single protein species of 21 
kDa was detected when the reaction went to completion, 
while protein bands corresponding to 6, 13, and 21 kDa 

20 were seen in partially cross-linked reactions (Figures 3 
and 4) . Higher oligomers were never seen. Thus, the 
region expressed is sufficient to form a trimer. 

In order to determine if the 7 Gly-Xaa-Yaa triplets 
at the N-terminal third of the peptide made any 

25 contribution to the formation of the trimer, collagenase 
digestion was carried out, and the molecular weight of 
the resulting peptide was reduced to 4 kDa. It was 
shown, by N-terminal sequencing, that all the collagen 
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triplets had been removed. This did not, however, 
reduce the ability of the remaining peptide to form 
stable trimers in solution (Figure 4) . 

Both peptides were also found to re-assemble into 
5 trimeric complexes even after heat -denaturat ion (98 °C 
for 20 min in phosphate buffered saline) and mixing 
varying portions of collagenase digested and intact 
neck -peptide followed by heat -denaturat ion and cooling 
resulted in hetero-trimerization to complexes in the 
10 expected stochiometric amounts (Figures 5 and 6) . 

Therefore, the O terminal 35 residues were sufficient to 
mediate the stable non-covalent reversible association 
into trimeric complexes . 

In order to obtain a complete structure 
15 determination of the neck-region peptide heteronuclear 
single quantum coherence '(^"H, 15 N) NMR spectra on 15 N 
labelled peptide were collected and showed only one 
magnetic environment for each residue. As the peptide 
exists as a trimer and as each residue within any one of 
2 0 the three a-helices shows the same magnetic environment 
as the corresponding residues in the other chains the 
structure of the a-helical bundle must have a 3 -fold 
symmetry. Thus, the neck-peptide assumes the same 
oligomeric structure as the trimeric stalk of influenza 
25 hemagglutinin [13] , but, unlike the virus stalk region 
peptide, the SP-D peptide formed a trimeric structure 
over a wide range of pH (3.0 - 9.5). The 3 -fold 
symmetry observed proves the non- staggered and parallel 
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association of the three helices and is contrasted by 
the staggered alignment of ant i -parallel helices 
demonstrated recently for the spectrin molecule [14] . 
Surprisingly, therefore, the association of three 
5 right-handed a-helices in a parallel and non- staggered 
left-handed superhelix can serve as the nucleation site 
for the formation of a right-handed collagen superhelix 
of left-handed helices. As the three polypeptide chains 
involved are identical and as the collagen helix and the 
10 of-helical bundle are positioned in a direct junction 

this region of the SP-D molecule should contain a sharp 
bending of the peptide structure. 

EXAMPLE 2 

15 Trimerization of the N- terminal domain of human SP-D by 
fusion to the N- terminus of the neck-region peptide 

The expression plasmid for this fusion peptide was 
generated by removing the DNA segment coding for the 
collagenous region of human SP-D from 10 /xg of the 

2 0 original cDNA containing plasmid by restriction enzyme 
digestion with 4 units of Nrul in buffer Nrul (New 
England Biolabs) and subsequently with 5 units of Smal 
in buffer J (Promega) at 25 °C for 4 hours, excising the 
456 bp fragment. The remaining plasmid was purified 

25 using the magic miniprep DNA purification resin 
(Promega) , re-ligated, and transformed into the 
competent cells of the BL21 bacterial strain of E.coli. 
The polymerase chain reaction was used to generate a 
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BamHl restriction enzyme site at the N-terminal end of 
the N-terminal domain of SP-D. The resulting PCR product 
was cleaved with BamHl and Ball to result in an open 
reading frame coding for 84 amino acids (28 of the 
5 N-terminal domain and 5 6 of the neck- region including a 
7 triplet collagenous linker between the two domains) . 
The polypeptide was generated as a 

glutathione-S-transf erase-N- term-neck -region fusion 
protein, illustrated in Figure 7. 

10 Individual colonies of BL21 carrying the 

recombinant plasmid, pGex-2T-N- term-neck, were 
identified to express a recombinant fusion protein of 
the expected (34 kDa) size after induction of protein 
expression with IPTG (see Example 1) . Large scale 

15 protein production was performed roughly as described 
above for the neck-region peptide, due to a similar 
behaviour of the N-terminal-neck-region protein on 
Highload S anion exchange chromatography. 

Briefly, protein expression was induced using IPTG. 

2 0 The cells of 6 1 bacterial culture were harvested by 
centrif ugation at 5k rpm at 0°C and resuspended in a 
buffer consisting of 100 mM Tris.Cl, pH 8.0, 200 mM 
NaCl, 20 mM EDTA, and the cells were lysed by 
sonnication for 2 minutes on ice. Cell debris was spun 

25 down at 19k rpm for 30 minutes at 0°C and the 

supernatant was applied to a glutathione-agarose 
affinity column, equilibrated in the lysis buffer. The 
resin was washed using the lysis buffer containing 0.2% 
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(w/v) Emulphogen (polyoxyethylene-10-tridecyl ether) 
until the absorbance at 280 nm reached the starting 
value again. Bound peptides were eluted using 20 raM 
glutathione (reduced form) in lysis buffer and thrombin 
5 digestion was carried out in this buffer at 37°C for 10 
hours by adding 10 units of thrombin per mg of fusion 
peptide. The pH was then adjusted to 3.0 by adding a 1 
M sodium citrate buffer, and subsequently 1 M HC1, to 
result in a 100 mM citrate buffered solution of pH 3.0. 

10 At this stage, a white precipitate, containing 

glutathione-S-transf erase, was removed by • centrif ugation 
(19k rpm at 0°C for 3 0 min) and the supernatant was 
applied to a Pharmacia HighLoad S column for anion 
exchange chromatography using a Waters FPLC system. The 

15 N-terminus-neck-region peptide eluted at 450 mM NaCl in 
a single, symmetrical peak and was shown to be free of 
contaminating proteins as judged by SDS-PAGE analysis 
and Coomassie blue R-250 staining. The purified peptide 
was dialysed against PBS and concentrated using 3 kDa . 

20 cut-off centricon cartridges. A 25 ml solution of the 
peptide in PBS of a final concentration of 1 mg/ml was 
recovered. This represents a yield of 4 mg/1 bacterial 
culture of recombinant proteolytically processed 
peptide. The purity of the peptide as well as its size 

25 was determined by SDS-PAGE analysis to have a molecular 
weight of approximately 9 kDa. 

Crosslinking experiments were conducted as 
described above for the neck- region peptide alone, and 
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the results are shown in Figure 8. In the absence of the 
covalent crosslinker the peptide behaves as a single 
polypeptide species of ca. 9kDa, however, under 
crosslinking conditions additional bands are visible, of 
5 18 kDa and ca. 2 9 kDa, corresponding to the dimeric and 
trimeric polypeptides in partially crosslinked 
complexes, and to the completely crosslinked trimeric 
complex present in solution. The neck-region has thus 
trimerized a heterologous protein domain which was fused 
10 to the N-termini>s of the neck-region sequence. 

EXAMPLE 3 

Trimerization of the C-type lectin domain of human SP-D, 
positioned at the C- terminal end of the neck-region 

15 In addition to these studies a protein expression 

system was employed which generates non- fusion 
polypeptides, thus giving a more accurate picture of the 
trimerizing features of the neck-region peptide. 

The pET series of expression vectors (Studier and 

20 Moffat, 1986) can be used to generate high-level 
intracellular production of non- fusion proteins in 
E.coli using IPTG as an inducer of protein expression. 
In order to use the plasmid pET 3a for the production of 
SP-D-neck-lectin and SP-D-lectin proteins the DNA 

2 5 inserts coding for the SP-D derived polypeptides were 
excised from the pGex-2T vectors using the restriction 
enzymes BamHr and EcoRl and the resulting fragments were 
ligated into the chloramphenicol -resistancy carrying 
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version of the pBluescript plasmid, pBCSK, * linearized 
with BamHl and EcoRl, and phosphatase- treated. The 
resulting plasmids, identified to contain the respective 
inserts using the magic miniprep method and restriction 
5 enzyme digestions, were digested with the restriction 
enzyme combination Xbal and EcoRV. This generated 
fragments with compatible ends to the pET-3a plasmid 
digested with Nhel and EcoRV. Thus, both DNA fragments, 
coding for the neck-region and the lectin domain or only 

10 the lectin domain of SP-D were transfered from a fusion 
protein generating expression system to the pET system 
in a 2 -step procedure. The additional residues, 
introduced at the N- terminal end of each of the 
constructs, were considered to be of minor influence, 

15 since they are unlikely to influence oligomerization of 
the recombinant proteins or binding to carbohydrate 
structures (Figure 9) . Both polypeptides were induced 
using IPTG and purified after lysis of the cells by 
conventional FPLC chromatography. Since both of the 

2 0 recombinant polypeptides were found to bind to FastFlow 
Q-Sepharose (Pharmacia) at pH 9.0 during the first step 
of purification, subsequent further purification was 
achieved on MonoQ (lectin domain) and MonoS (neck-lectin 
domain) columns (Pharmacia) . 

25 The purified proteins were dialysed against PBS at 

4°C and samples (50 fil) were analysed on 12.5 % SDS-PAGE 
gels. Bis- (sulf osuccinimidyl ) -suberate amino-reactive 
crosslinker was added at 2 different concentrations and 
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the protein bands detected in lanes corresponding to 
these samples revealed that the two proteins differed in 
respect to their oligomeric status (Figure 10) . The 
C-type lectin domain was found to behave like a 
5 monomeric protein in solution whereas the neck-lectin 
domain showed the expected crosslinking pattern of a 
trimeric molecule in solution. Thus, the neck-region 
mediates the trimerization of the C-type lectin domain 
of human SP-D which is -shown to form a monomeric 

10 molecule without the neck-region. The trimerized lectin 
domains were also found to bind more strongly to the 
affinity matrix maltose - agarose , whereas the monomeric 
lectin domain (without the. neck-region) showed a weaker 
affinity (data not shown) . 

15 The two proteins expressed may provide valuable 

tools for the study of native carbohydrate ligands for 
human SP-D, since the three lectin domains in the 
neck-lectin molecule are expected to have the. same 
spacing of their binding sites for carbohydrates as the 

20 lectin domains present in a single 'rod' of native human 
SP-D. These results indicate also that heterologous 
protein domains, fused to the neck-region peptide, will 
form trimeric complexes . 

20 /zg of recombinant neck-lectin polypeptide were 

25 mixed with 40 of neck-region peptide at 25 degree C. 
The mixture was analysed using a FPLC Superose 12 
(Pharmacia) size exclusion chromatography column, 
equilibrated in PBS. The remaining solution was heated 
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to 50 degree C for 30 min and subsequently left to cool 
to room temperature over a 20 min period. 100 M l of the 
solution were then analysed on the same Superose 12 
column . 

5 The elution profiles of both runs are illustrated in 

figure 11. Two distinct peaks corresponding to the sizes 
of the respective homotrimers of neck-region peptide and 
neck-lectin polypeptide were detected in the first run 
(figure 11 A a) ) , whereas the profile of the heat- 
10 treated peptide mixture had changed (figure 11 a b) ) . 
The first peak corresponding to the neck-lectin 
homotrimer shifted to a later elution time, 
corresponding to a smaller size. The second peak, caused 
by the neck-region peptide homotrimer remained at its 
15 original position, but was reduced in -bight, indicating 
a reduced amount of neck-region peptide homotrimer. The 
shifted first peak was found in crosslinking experiments 
(data not shown) to consist of 2 neck-region peptides 
and 1 neck-lectin polypeptide, held together as a 
20 heterotrimeric complex. 

Therefore a heterotrimerization had occurred via the 
neck region* s-a-helices which are contained within the 
sequences of both polypeptides. The large molecular 
excess of neck-region peptide homotrimers at the 
• 25 beginning of the heterotrimerizing experiment has driven 
the reaction to yield only two trimeric complexes, 
namely the neck-region peptide homotrimer and a single 
species of heterotrimer with one neck-lectin polypeptide 
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and two neck-region peptides. An increased concentration 
of the neck-lectin homotrimer at the beginning of the 
experiment drives the reaction to the other extreme, 
resulting in neck-lectin homotrimers being re-formed and 
5 a single species of heterotrimeric complexes consisting 
of two molecules neck-lectin polypeptide and one neck- 
region peptide (data not shown) . 

EXAMPLE 4 

10 Nucleation of collagen triple-helix formation 

Two DNA constructs were made to generate fusion 
proteins with glutathione-S- transferase (see example 1) : 
the neck-region peptide with 57 triplets (Figure 12) and 
the N- terminal non- collagenous residues of human SP-D, 

15 and 48 Gly-Xaa-Yaa triplets derived from human SP-D 
without the neck-region peptide fused directly to the 
glutathione-S-transf erase . Both fusion proteins were 
generated and purified using the protocol outlined in 
Example 1 . 

2 0 Prior to the cleavage with thrombin the fusion 

proteins were diluted tenfold in 2 M glycine buffer at 
pH 7.5 and subsequently dialysed extensively against 
100 mM Tris.HCl (pH 7.4) 200 mM NaCl . The thermal 
stability of polypeptides in solution may be greatly 

25 enhanced by the addition of 2 M glycine [24] . 

Upon digestion with thrombin and subsequent 
SDS-PAGE analysis marked differences were seen in the 
sizes of the cleavage products. The fusion protein 
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consisting of the glutathione-S- transferase and the 48 
Gly-Xaa-Yaa triplets only gave rise to a large number of 
peptides of different length, reflecting the frequently 
occurring cleavage by thrombin of peptide bonds 
5 involving arginine residues within the collagenous 
sequence (data not shown) . In contrast, the 
glutathione-S-transf erase fusion protein containing the 
neck-region peptide C-terminal to the 57 Gly-Xaa-Yaa 
triplets and the N-terminal non- collagenous peptide from 

10 human SP-D showed only a single cleavage into two 

products, the glutathione -S-transf erase and the entire 
collagenous region with the neck-region peptide and the 
N-terminal peptide of human SP-D attached (Figure 14) . 
As the 4 8 Gly-Xaa-Yaa triplets of the first construct 

15 were contained in the 57 Gly-Xaa-Yaa triplets of the 
second construct, the absence of thrombin cleavage at 
any of the arginine residues is consistent with the 
presence of collagen triple-helical structure. 

The formation of a collagen triple-helix (Figure 

20 13) can be detected by circular dichroism [25], 

multi- dimensional NMR [26] , and electron microscopy 
[27] . 

The involvement of the N-terminal non-collagenous 
region of human SP-D in the observed increase of 
25 stability of the triple-helix can be examined. Thus, as 
the natural occurrence of coiled-coils at the N-terminal 
end of collagenous sequences is seen in the macrophage 
scavenger receptor, short peptide sequences are attached 
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to N- terminal end of the collagenous regions N- terminal 
of the neck-region peptide using protein engineering 
techniques. This involves the use of polymerase chain 
reactions with synthesised oligonucleotides and 
5 subsequent ligations into the fusion-protein 

combining- site in the pGEX-2T vector, already carrying 
the DNA encoding the neck -region (see example 1) . 

The resulting set of purified recombinant peptides 
is tested for correct alignment of triple-helical 

10 peptides using amino -reactive chemical 

crosslinking-reagents in combination with SDS-PAGE and 
size-exclusion chromatography. A few suitable peptides 
are then 15 N-isotope labelled and analysed for thermal, 
stability using multi-dimensional NMR. At this stage 

15 the influence of the triplet number on the . melting 
temperature is examined by subsequent insertion of 
longer stretches of collagen-coding DNA in a similar 
fashion. 

20 EXAMPLE 5 

Increased binding of C-type lectin domain of SP-D when 
trimerised using neck region peptide 

The pBluescript plasmid containing cDNA coding for 
human SP-D protein was digested with the restriction 
25 enzymes Smal and EcoRl, as well as with Mscl and EcoRl, 
giving rise to DNA fragments of 532 bp, and 364 bp, 
respectively. Both fragments were subcloned into the 
pGex-2T expression vector, linearized with the 
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restriction enzymes Smal and EcoRl . This directional 
cloning procedure produced two expression plasmids, 
pGex-2T-neck-lectin and pGex-2T-lectin, which were 
transformed into the E.coli BL21 strain. Protein 
5 expression was induced with IPTG and clones identified 
to produce recombinant proteins of the expected size, 
i.e. 43 kDa for pGex-2T-neck-lectin and 37 kDa for 
pGex-2T-lectin (Figure 15) , were used to start 
large-scale preparation of glutathione -S-transf erase 

10 containing fusion proteins, as described above. 

Following cell lysis by sonnication and removal of 
the cell debris by centrif ugation, a portion of the 
supernatant (corresponding to 100 ml of the original 
bacterial culture) was made 5 mM in respect to 

15 unchelated calcium, and the solution was diluted 

ten-fold in 100 mM Tris.Cl, 150 mM NaCl , 5 mM CaCl 2/ 1 
mM NaN 3 , pH 7.5. The resulting solution was dialysed 
against 10 1 of the same buffer, at 4°C overnight, and 
after removal of some additionally formed precipitate by 

20 centrif ugation the protein solution was passed, at 10 
ml/hour, over a maltose-agarose column, equilibrated in 
the same buffer. The column was washed with 50 ml of 
the buffer and bound proteins were eluted using 20 mM 
EDTA in 100 mM Tris.Cl, 150 mM NaCl, pH 7.5. 

25 Samples (50 /xl) were analysed on 12.5 % SDS-PAGE 

gels, but only the pGex-2T-neck- lectin encoded protein 
of approximately 43 kDa could be purified in this way. 
Apparently, the fusion protein containing only the 
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C-type lectin domain fused to the glutathione-S- 
transferase was not able to bind to the immobilized 
maltose under the conditions used (Figure 16) . This 
observation is consistent with the suspected role of the 
5 neck-region peptide in bringing together three identical 
polypeptide chains, in a parallel orientation, and thus 
enhancing the binding properties of the adjacent C-type 
lectin domains . 

10 EXAMPLE 6 

Trimerization of a single -chain antibody 

Total RNA prepared from the hybridoma cell line 
OX35 [15] was used as a template for cDNA-PCR [16] to 
generate DNA encoding the variable region of the light 

15 and heavy chain of the anti-CD4 monoclonal antibody 
secreted by the hybridoma cells. Both fragments, of 
400bp length, were cloned into the pBluescript SK vector 
(Stratagene) and subsequently combined using a synthetic 
DNA fragment encoding the semi-rigid linker-peptide 

20 (GGGGS) 3 , which ensured the correct pairing of the V L 
and H L domains [17] . The sequence of the construct was 
confirmed by dideoxy sequencing of the DNA. 

The DNA fragment encoding for the two variable IgG 
domains, linked by the GGGGS- linker, was then cloned 

25 into the pGEX-2T vector, already carrying the 

neck-region peptide gene (see example 1) , to give rise 
to a fusion protein, GT-OX3 5-scAb-neck, (of an expected 
molecular weight of 55 kDa) consisting of the 
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single-chain antibody, as well as the 

glutathione-S-transf erase and the neck-region peptide. 
The single-chain antibody encoding DNA was also cloned 
into the pGEX-2T vector alone, without the neck-region 

5 peptide gene, giving rise to an open reading frame 
coding for the OX35-scAb fused to the glutathione -S- 
transferase (resulting in a fusion protein of 50 kDa) . 
Both expression vectors constructed were transformed 
into E.coll BL21 cells (see Figure 17). 

L0 Expression levels were found to. be similar to that 

obtained with the neck-region peptide alone (Example 1) 
(see Figure 18) . However, following the purification 
protocol, as outlined in example 1, most of the fusion 
protein was found to be insoluble and only a small 

15 proportion of the single-chain antibody fusion proteins 
could be solubilized and purified using a 

glutathione -agarose affinity column. Upon cleavage with 
thrombin fragments of the expected size were detected 
using SDS-PAGE analysis. Minor amounts of smaller 
20 fragments were also seen. 

The single chain antibody containing polypeptides 
were purified and on SDS-PAGE analysis had an apparent 
molecular weight of 25 kDa for the OX35-scAb and 3 0 kDa 
for the OX35-scAb-neck polypeptides, however, using 
25 size-exclusion chromatography, an increased apparent, 
molecular weight was observed for OX35-scAb-neck, 
whereas the OX3 5-scAb showed the expected behaviour of a 
2 5kDa polypeptide on the size exclusion column. 
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Chemical cross -linking experiments, as well as 
sucrose density centrif ugation analysis, determine the 
oligomeric status of the OX3 5-scAb-neck polypeptide. 
Estimation based only on the results obtained in the gel 
5 filtration analysis provide indication of the presence 
of a trimeric molecule. Affinity measurements with the 
trimeric and monomeric scAbs using immobilized 
recombinant CD4 [18] in ELISA and BiaCore Plasmon 
Resonance [19] analysis may be carried out. A 

10 substantial improvement of yield and. structural 

uniformity of the expressed antibody constructs may be 
obtained by following established protocols for the 
purification of recombinant proteins from bacterial 
inclusion bodies [20] or the use of a yeast expression 

15 system, known to facilitate expression of disulphide 
containing molecules [21] . 
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CLAIMS 

1. A non-naturally occuring polypeptide comprising a 
first sequence of amino acids which is a neck-region of 
a collect in, or an amino acid sequence variant thereof 

5 or a derivative thereof, able to form a trimer. 

2 . A polypeptide according to claim 1 wherein the 
first sequence of amino acids is the neck-region of 
collectin SP-D or a variant or derivative thereof. 

3 . A polypeptide according to claim 2 wherein the 

10 first amino acid sequence is the neck -region amino acid 
sequence shown in Figure 1 . 

4 . A polypeptide according to claim 1 wherein the 
first amino acid sequence is the neck region of 
collectin-43 or conglutinin, or a variant or derivative 

15 of one of these . 

5 . A polypeptide comprising a first sequence of amino 
acids having an amino acid pattern and/or 
hydrophobicity profile the same as or similar to the 
neck region of collectin SP-D, able to form a trimer. 

20 6. A polypeptide according to any one of claims 1 to 
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5 wherein said first sequence of amino acids is fused 
to one or more heterologous amino acids. 

7. A polypeptide according to claim 6 wherein the 
heterologous amino acids comprise a protein domain* 

5 8. A polypeptide according to claim 6 or claim 7 
wherein the heterologous amino acids comprise a 
sequence derived from an immunoglobulin. 

9. A polypeptide according to any one of claims 6, 1 
or 8 wherein the first amino acid sequence is joined to 

10 heterologous amino acid or acids via a peptide linker. 

10 . A polypeptide according to any one of claims 6 to 

9 wherein the heterologous amino acid is or 
heterologous amino acids comprise an amino acid which 
is derivatisable for attachment of a chemical moiety. 

15 11. A polypeptide according to any one of the 

preceding claims joined to a non-peptide moiety. 

12 . A polypeptide according to any one of claims 6 to 

10 comprising said heterologous amino acid(s) N- 
terminal to the first sequence of amino acids and one 

20 or more amino acids C-terminal to the first sequence of 
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amino acids, or said heterologous amino acid(s) C- 
terminal to the first sequence of amino acids and one 
or more amino acids N-terminal to the first sequence of 
amino acids . 

5 13 . A polypeptide according to any one of claims 1 to 
12 comprising a collectin C-type lectin domain. 

14 . Nucleic acid comprising a sequence of nucleotides 
encoding a polypeptide according to any one of the 
preceding claims . 

10 15. Nucleic acid according to claim 14 which is a 
vector. 

16. A host cell containing nucleic acid according to 
claim 14 or claim 15. 

17. .Nucleic acid according to claim 14 or claim 15 
15 wherein the encoding sequence is operably linked to a 

regulatory sequence for expression of the polypeptide. 

18. A host cell containing nucleic acid according to 
claim 17. 

19. A method comprising expression from nucleic acid 
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according to claim 14 of the encoded polypeptide. 

20. A method comprising culturing a host cell 
according to claim 18 under conditions for expression 
of said polypeptide. 

5 21. A method comprising forming a trimer comprising a 
polypeptide following its expression according to the 
method of claim 19 or claim 20.- 

22. A method according to claim 21 wherein said trimer 
is a homotrimer. 

10 23. A method according to claim 21 wherein said trimer 
is a heterotrimer . 

24. A method comprising isolation of a polypeptide 
following its expression according to the method of 
claim 19 or claim 20. 

15 25. A method comprising forming a trimer comprising a 
polypeptide following its isolation according to a 
method of claim 24 . 

26. A method according to claim 25 wherein said trimer 
is a homotrimer . 
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27. A method according to claim 25 wherein said trimer 
is a heterotrimer . 

28. A method comprising forming a trimer comprising a 
polypeptide according to any one of claims 1 to 13. 

5 29. A trimer comprising a polypeptide according to any 
one of claims 1 to 13, 

30. A trimer according to claim 29 which is a 
homotrimer . 

31. A trimer according to claim 2 9 which is a 
10 heterotrimer. 

32. A method of forming a collagenous triple helix 
comprising providing non-naturally occurring 
polypeptides, each polypeptide comprising a series of 
collagenous triplets N-terminal to a first sequence of 

15 amino acids which is a neck-region of a collectin, or 
an amino acid sequence variant thereof or a derivative 
thereof, able to form a trimer and causing or allowing 
said polypeptides to form trimers . 

33. A method according to claim 32 wherein the first 
20 sequence of amino acids is the neck-region of collectin 
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SP-D or a variant or derivative thereof. 

34. A method according to claim 33 wherein the first 
amino acid sequence is the neck- region amino acid 
sequence shown in Figure 1 . 

5 35. A method according to claim 32 wherein the first 
amino acid sequence is the neck region of collectin-43 
or conglutinin, or a variant or derivative of one of 
these . 

36. A method according to any one of claims 32 to 35 
10 wherein said first sequence of amino acids is either at 

the C- terminus of the polypeptide or said polypeptide 
comprises one or more heterologous amino acids C- 
terminal to said first sequence. 

37. A method of forming a collagenous triple helix 
15 comprising providing non-naturally occurring 

polypeptides, each polypeptide comprising a series of 
collagenous triplets N-terminal to a first sequence of 
amino acids which has an amino acid pattern and/or 
hydrophobicity profile the same as or similar to the 
20 neck region of collectin SP-D, able to form a trimer, 
and causing or allowing said polypeptides to form 
trimers . 
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38. A method according to any one of claims 32 to 37 
wherein said polypeptides are provided by expression 
from encoding nucleic acid therefor. 

39. A method according to claim 3 8 wherein said 
5 trimers are isolated following trimerisation. 
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